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We study Sutton's 'micro-canonical' model for the internal organization of firms, that leads to 
non trivial scaling properties for the statistics of growth rates. We show that the growth rates 
are asymptotically Gaussian in this model, at variance with empirical results. We also obtain 
the conditional distribution of the number and size of sub-sectors in this model. We formulate 
and solve an alternative model, based on the assumption that the sector sizes follow a power-law 
distribution. We find in this new model both anomalous scaling of the variance of growth rates and 
. non Gaussian asymptotic distributions. We give some testable predictions of the two models that 

would differentiate them further. We also discuss why the growth rate statistics at the country level 
and at the company level should be identical. 
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The annual growth rate of a company is fluctuating both across companies and from year to year. It is therefore 
tempting to study the statistics of this growth rate. It has been known for many years that the average growth rate 
is to a good approximation independent of the size of the company. This is known as Gibrat's proportionality law: 
£ — ' since the growth rate is the relative size increase of a company (where the size refers to the sales, the number of 
employees, etc.), the fact that the average growth rate is independent of the size means that on average a company 
' grows proportionally to its size. 

A very interesting question, that was only addressed recently, concerns the fluctuations of the growth rate, and 
the size dependence of these fluctuations. Quite remarkably, Stanley et al. found that the standard deviation a of 
the growth rate r decreases with the size S of the company as a(S) ~ 5~' 9 , with [3 as 0.18 Jlj,^. This power-law 
scaling holds over six decades, and can be extended to larger sizes by considering countries as 'companies' and taking 
the GNP as a measure of the size ||. More precisely, the distribution of the rescaled growth rate v = r/a(S), with 
cr(S) ~ , appears to be size independent. This rescaled distribution II(i;) is furthermore found to be non Gaussian. 

This remarkable result is puzzling because one could have naively expected that large companies (or countries 
for that matter) would aggregate different independent 'shocks' that would lead, using the central limit theorem, 
to a S^ 1 / 2 decrease of the volatility of its growth rate, which would furthermore be Gaussian for large S's. This 
however assumes that a company can be thought of as a collection of K 'sub-companies' of average size Sq and weakly 
correlated activities. In this case, K = S/So and if the shocks affecting each sector of activity have a finite second 
moment, the central limit theorem applies. 

The fact that /3 < 1/2 suggests otherwise. Obviously, if all the sectors of activity of a given company had strong 
cross-correlations, one would find the extreme result that (3 = 0. However, this is not the case: Sutton has shown some 
empirical data that support the idea that the growths of different sectors are to a good approximation uncorrelated [Q . 
This is what Sutton called the 'scaling puzzle', which lead him to propose a simple model for the internal organization 
of firms that predicts asymptotically f3 = 1/4, not very far from the empirical result Q|. 

The aim of this note is threefold. In a first part which we intend to be also of pedagogical interest, we revisit Sutton's 
model using methods from statistical physics, and obtain a number of complementary predictions of this model that 
can be compared with empirical data, in particular the distribution of rescaled growth rate II (v), which we find to 
be asymptotically Gaussian, at variance with the empirical result. Second, we introduce and study an alternative 
model where we argue that the distribution of sizes of the sub-sectors is a power-law, and derive analytically the 
value of j3 and the shape of II(w), which in some regime is found to be strongly non-Gaussian. We then compare our 
results to the findings of Stanley et al. and discuss the plausibility of our alternative model. Finally, we discuss the 
interesting fact that GNP growth and company growth behave similarly. This means that the microeconomical and 
macroeconomical levels are strongly interconnected. We show that our model is indeed stable upon aggregation. 
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II. SUTTON'S MODEL 



We first recall Sutton's model. In the absence of more information, Sutton postulates that all partitions of a 
company of size S in smaller sub-pieces are equiprobable [Q. This is a kind of 'microcanonical', minimum information 
assumption, similar to the corresponding hypothesis in statistical physics where all microstates are equiprobable. For 
physical systems, this is justified by the Liouville theorem that is itself a consequence of Hamiltonian dynamics; it 
would be interesting to find an analogue of this theorem for the (stochastic) dynamics underlying the organization of 
firms. 

More precisely, Sutton assumes that S is a large integer, and uses known mathematical results on the number of 
partitions, to compute cr(S). Let us show how his results can be recovered directly. For this, we introduce the following 
quantity!!! 



oooo oo / K \ r k / k \ 

Af(R,K,S)= E ••• E 5 [ S -H s Aj H p (vMm5 U-E s ^ 

si = lS2=si s K =s K -i \ 1=1 / 1=1 \ i=l / 



(1) 



This quantity counts the number of partitions of the integer S in exactly K integers s\, S2, ■ ■ ■ , sk, and such that 
the total absolute growth rate R is given by the sum of independent random variables r\i (that we suppose of finite 
variance), each weighted by the size Si of the sub-sector. This assumes a proportionality effect at the sub-sector level. 
It will be convenient to introduce the Fourier-Laplace transform of this quantity (or generating function), defined as: 

oo oo poo 

A/Xq, ^, A) = E E / dRexp[iqR- vK - \S]Af{R,K,S). (2) 
s=i k=i 

The quantity J\{q = 0, v = 0, A) is therefore the Laplace transform of the total number of partitions, and is given by: 

oo oo oo oo / K \ 

Af(q = 0,v = 0,\) = J2I2lL,--- E exp -A^> , (3) 

K=l si = l s 2 =si s K =s K -l \ i=l / 

and be computed explicitly as: 

oo —\K 

For A — > 0, the sum over K can be approximated by an integral: 

N{q = 0, n = 0, A) w jf difexp^-AA' - jf dx ln(l - e~ Xx )j . (5) 

Now the integral over K can be estimated using a saddle-point method. The saddle point K* obeys the following 
equation: 



which for small A gives: 



Plugging this result in Eq. (|5|) leads to: 



A = -ln(l-e- AA ), (6) 



A (</ n. ,/ = u. A ID - < xp ( j J dv\n(l - e~ v )j = exp (j^j , (8) 



1 In the following equation, 5 refers to the Dirac delta for continuous variables and to the Kronecker delta for discrete variables. 
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where we have neglected preexponential corrections, that can also be computed. Now, it is easy to check that the 
inverse Laplace transform of Jx(q = 0, v = 0, A — > 0) behaves, for large S, as: 



(9) 



which is the Hardy- Ramanuj an result at large S ||. In the course of the calculation, one also discovers that, as far as 
scaling is concerned, A ~ S 1 ^ 1 / 2 . One could extend the computation to get the exact prefactor, equal to (A\/ZS)~ X . 
One can easily extend the computation to v =^ 0. The saddle point is now at: 

ir«il n — !— . (10) 

Now, setting v = x\/\ In A|, one finds, in the limit A — > 0, 

A/~(g = 0,:rA/|lnA|,A) _ _ x 
Af{q = 0,0, A) 



(11) 



Having noted that e 1 is the Laplace transform of S(u — 1), we conclude that when S — ► oo, the variable K/VShiS 
tends to unity with probability one. One can also study how the fluctuations behave for large S. Setting K = 
one finds that the Laplace transform P(z) of the distribution P(y) of the random variable y reads, 

for S — » oo: 

p(z) = J dye~ zy P(y) = exp [-z + (l + z) ln(l + z)] , (12) 

which shows that the distribution of y is non Gaussian, even in the limit S — > oo. For example, the skewness of P(y) 
is found to be equal to —1. In summary, we find that the average number of 'sub-entities' is equal to Veins', with 
relative (non Gaussian) fluctuations which go to zero as 1/lnS. The average size of a sub-piece is clearly equal to 
VS/\nS. 

Therefore, the most probable partition of a large integer S is to break it in y/S parts of size y/S (neglecting 
logarithms) ||. In fact, as we now show, this is not really correct. A better description is to say that one has y/S 
pieces of size 1, VS/2 pieces of size 2, ... and one piece of size y/S. More precisely, what is the average number of 
occurrences N(s\S) of a piece of size s, given the total size 5 1 ? A little reflection tells us that this is given by: 

oo oo 

N(s\S) = JV(S)(0(s\S)) = ^2JV(S -ks) = ^2 kQ(S - ks), (13) 

fe=i fc=i 

where Q is the probability of occurrence that the number s appears exactly k times in the partition, defined by the 
above equation. For 1 <C s <C S, N(s\S) can be approximated by: 

N(s\S)nJV(S)Ye X p(— • (14) 

One therefore finds the following interesting result: the size distribution of sub-sectors follows, in Sutton's model, a 
Bose-Einstein distribution. This distribution behaves as a power law 1/s, for s -C yS and decays exponentially fast 
for s > y/S. This is a directly testable prediction of Sutton's model. One can furthermore check directly that: 

i^^m «r d « =s , (15) 



Af(S) b 2 J e u -l 

as it should. 

As noticed by Sutton, the quantity AT(slS') is interesting because it allows us to compute the variance cr^S) of the 
absolute growth rate R, defined as: 



a^S) - (R 2 \S), (16) 

where the brackets means an average over the random growth rates r\i and the overline is an average over all partitions 
of S. Using the fact that the rji's are independent and of variance equal to ct 2 , one has: 
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(R 2 \S) = <j 2 o 



2 V 



2 5 / 2 3 3 / 2 C(3) 



exp(^)-l 



a 2 S 3/2 = 1.13955.... ct 2 ^ 3 / 2 . 



(17) 



This is Sutton's result: the conditional variance of the absolute return grows as S 3 ^ 2 , therefore the variance of the 
relative return r = R/S decays as S 1-1 / 2 , which is equivalent to the statement that /3 = 1/4 0]. In intuitive terms, the 
total absolute return is the random sum of ~ y/S different terms, all of order ~ y/S, which gives a random number of 
order VsW^ ~ S" 3/4 . This rough (and slightly incorrect) argument actually suggests that the absolute growth rate 
is the sum of a large number (yS) of random variables, and therefore should be Gaussian for large S. We can show 
this more precisely by computing the kurtosis k of i?, defined as: 



(R 4 \S) 



(R 2 \S) 

We assume that the individual growth rates 77 have a finite kurtosis given by kq. Therefore: 



(3 + K )$>? + 3$>? a 



S * + 3 J2 S i S . 



The first term is easy to compute using N(s\S) and one finds: 



E*i=E S *^W - ^f^C(5) S*' 2 = 7.17114.... 



(18) 



(19) 



(20) 



The second term is more subtle since one needs to know the correlation of the number of occurrences of two integers 
s,s' involved in the partition of S, (0(s\S)0(s'\S)). This quantity can be obtained similarly to A r (s|S'). For s = s', 
one has: 



Af(S)(0(s\S) 2 ) ee J2 k 2 Q{S - ks) = ]T(2fc - l)Af(S - ks). 



k=l 



k=l 



which can be again be approximated as 



(0( 



bks \ e M^) + 1 



v 



Therefore: 

s 



£V [(0(s\S) 2 ) - (0(s\S)Y< 



s=l 



duu 



(e* - l) 2 



29/233/2 
5tt 



55/2 = 7.48509.... 5 5 / 5 



The terms with s 7^ s' can be computed from: 



bks 



b(s 



8S 3 / 2 



(ks + ls') 2 + 



(21) 



(22) 



(23) 



(24) 



The reason we took one extra term in the above expansion is that (0(s\S)0(s'\S)) — (0(s\S)} (0(s'\S)} is zero to first 
order. The non zero correlation comes from the term k£ss' in the above expression. To lowest order, one finally finds, 



s ol9/2o5/2 

V s 2 s' 2 \{0{s\S)0{s'\S)) - (0(s\S))(0(s'\S))} » ^[ C (3)] 2 S 3 / 2 = -53.2953...S 3 / 2 . 



(25) 



This contribution is a factor S smaller than the other two contributions, but has a rather large prefactor. To leading 
order in S, the final result reads: 
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k Ps -L [5. 52232. „k + 5.76408...] (26) 

The conclusion is that if the growth rates of the sub-sectors are non Gaussian, the kurtosis of the aggregate growth 
rate decreases as 1/y/S for large S. This is expected on general grounds, since we have seen above that the number 
of independent sub-entities is of order y/S, and the kurtosis of a sum decreases as the inverse of the number of 
independent terms in a sum. The second contribution comes from the fluctuations of the numbers of terms in the 
sum. 

Therefore, asymptotically, the rescaled aggregate growth rate rS 1 ^ 4 is found to be Gaussian in Sutton's model, at 
variance with empirical findings. However, for finite 5, there are important corrections to this asymptotic result: 
suppose that the initial kurtosis of rj is equal to 3, which is the case when r\ is distributed according to a symmetric 
exponential. Take a reasonable value S = 100. Then the residual kurtosis of the growth rate is still quite large, 
~ 2.2. Hence, significant deviations from a Gaussian distribution may be observed in reality, but should diminish as 



S becomes large. We shall come back to this issue in section [V 



III. AN ALTERNATIVE MODEL 
A. Definition of the model 

We now discuss another model, where we assume that companies are formed by aggregating entities that have a 
certain a priori distribution of sizes, that we choose to be a power-law. The motivation for this is two-fold. First, the 
distribution of company sizes in a country is known to be a Pareto (power-law) distribution. Since the scaling law 
for the variance of the growth rate also seems to hold at the country level, one could indeed argue that the actual 
distribution of company sizes should play a role. Second, there is a quite general and plausible dynamical model that 
leads to a power-law distribution of sizes. Assume, as in Sutton's model, that each sub-entity in a company has a 
random growth rate. The role of the business management is, to a certain extent, to redistribute the income of each 
sector of activity such as to help the less performing ones to catch up. Therefore, a reasonable dynamical model for 
the size Si(t) of a given sub-entity is: 



Vi(t)si(t), (27) 



where the first two terms describe redistribution of resources among the sub-entities, and the last term the random 
growth rate. The parameter 7 measures the strength of the redistribution policy. It can be shown that the stationary 
distribution for such a stochastic process has a power law tail, p(s) ~ s _1_A1 , with /j, = 1 + j/cr^. (See the detailed 
discussion and generalization in ||, and also |l^,|ll| for alternative models.) 

Hence, we assume that the a priori distribution of the size of sub-entities has a power law tail: 

K*)«|rjr (« -oo), (28) 

We also assume that a company is composed of an arbitrary number K of such sub-entities, with a certain a priori 
weight Q(K). This means that if one chooses randomly a company in a country, there is a probability proportional 
to Q(K) for this company to contain exactly K sectors. We will see below that Q{K) can be inferred from empirical 
data. The unnormalized distribution of growth rates for a given company size S reads, in this new model: 

00 K / K \ K / K \ 

K=l »=1 V <=!/"' i=l V i=l / 



B. Distribution of company sizes 

Let us first establish some results on the size distribution of companies M{S) = J dRAf(R,S), a quantity much 
studied in a different context in p^ |. This will also enable us to relate Q(K) to this empirically observable quantity. 
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We first study the case the simplest case Q(K)=1. The following results are obtained using Laplace transforms, as 
above. We write: 



#(?,A) = 



dS 



dR exp [iqR - XS] Af(R, S) = ^ 

K=l 



dsdrjp(s)P(rj) e 



iqsrj—Xs 



K 



For simplicity, we assume that P{rf) is Gaussian with unit variance, and introduce the quantity g(q, A) as: 



in terms of which one finally has: 



g(q,X) = / dsp{s) 



1 _ e -q 2 s 2 /2-Xi 



gig, A)' 



(30) 



(31) 



(32) 



All the following asymptotic results will only depend on the behaviour of g(q, A) in the limit q, A — ► 0. If we first 
study the case q = from which Af(S) is deduced, one finds that one has to distinguish the cases /i < 1 and /i > 1 
p2| . The small A behaviour of g is found to be: 

fl(« = O l A)«r(l-/i)(*oA)" (m<i); j(? = 0,A)«A(.) (m>1). (33) 
For /i > 1, the average size of a sub-entity is finite and equal to (s). Inverting the Laplace transform then leads to: 

1 



N(S) 



(34) 



for /i > 1, whereas for /i < 1, one has: 



sin7r/i 1 



so V S 



(?) 



(35) 



The case \i = 1 is special and involves logarithmic corrections. Intuitively, the difference of behaviour comes from 
the fact that when \x > 1, the typical number of sub-entities behaves as K ~ S/(s), whereas when /i < 1, a single 
sub-entity represents a sizeable fraction of the whole company and K ~ 5 P <C 5 . 

Assuming now that Q(K) decays as a power-law Q{K) ~ if _1_Q and that > 1, we find, using the same method, 
the following result for P(S): 



i 



5' 1 



(a < M ); AT(S) 



1 



(a > /i). 



(36) 



The case a> corresponds to a situation where large companies only contain a small number of sectors (see below) . 
This is not very plausible; furthermore, this would lead to a variance of the growth rate R that grows proportionally 
to size S, i.e. (3 = 0, which is not compatible with empirical data. Therefore we will assume in the following a < /i. 
In this case there is a direct relation between the tail of Q(K) and the tail of the size distribution of companies. 
Empirically, a is found to be close to unity: a w 1.05 fHf ]. 

C. Fluctuations of the growth rate 

We now turn to the prediction of this model for the growth rate fluctuations. One need to consider three cases: 
/i > 2, 1 < fi < 2, n < 1. The case 1 < fi < 2 is, as we will show, the interesting one. In the relevant situation where 
a < fx, one can show that the value of (3 and the shape of the rescaled distributions are independent of the value of 
a, and we choose in the following, for simplicity Q(K ) to be constant. 

We now need to study M(q, A) with q ^ 0, that gives access to the distribution of the growth rate. This case can be 
treated by identifying the correct scaling region in the q, A plane, which means, in concrete terms, the scaling relation 
between R and S. For example, when fi > 2, one expects the Central Limit Theorem to hold, suggesting R ~ y/S. 
So we set q = 9\J~X, and take the limit A — > 0. If fi > 2, one finds: 



9(q, A) « A 



+ 



(37) 



G 



where the • • • refers to higher order terms in A, the precise form of which depend on the value of fi. Therefore, in this 
regime, 



(40) 



Af(q, A) sa — m . (38) 

\[(a) + %{8*)] 

Now, we introduce the probability P(R\S) to observe a certain growth R given S. Then, by definition, 

poo poo 

Af(q,X)= dS dRexp[iqR~- \S\AT(S)P(R\S). (39) 

JO J -oo 

Assuming that P(R\S) = S-^ 2 H(RS-^ 2 ) and using the above result for Af(S) leads to: 

poo POO 

J\f(q = 6V~X, A) » -j-r / du I duexp [iQv^fu - u] H(v), 
A(s) J J_ oc 

where we have set AS* = u and R = vyS. It is now easy to see that Eqs. ©, © are satisfied if: 

n(v) = -JL= exp(-^) a 2 = { -fl. (41) 
V27rer 2 2cr^ (s) 

Therefore, in the case [i > 2, the variance of the relative growth rate decreases as S" -1 / 2 (i.e. (3 = 1/2), and the 
distribution of growth rates is Gaussian. 

More interesting is the case 1 < fi < 2. It turns out that in this regime, the correct scaling is P(R\S) = 
g-i/Mn^S*- 1 /^) and q = OX 1 /^. In this regime, one now has, for small A: 

g{q,\)tt\[{s)+n\s 6\n{ii)]+---, (42) 

where 

JO*) = [ ^(1 - e-* 2 / 2 ) = -2-^/ 2 r(^). (43) 

Now, it is easy to show that the scaling function II(u) is now precisely a symmetric Levy stable distribution of index 
/i, L^v). This comes from the fact that the Fourier transform of L^(v) gives exp(— j4u|0| m ), where A is a constant, 
so that the integral over u in Eq. (]!(]) now reproduces Eq. ( {42; ) . 

However, this is not the whole story. The reason is that a direct computation of the variance of R (from the 
derivative of g(q, A) with respect to q 2 at q = 0) leads an apparently contradictory scaling, since: 

(R 2 \S) oc S 3 -' 1 (44) 

instead of S 2 ^ 1 as one might have naively expected from the scaling form of P(R\S). One should now remember that 
Levy stable distributions L^(v) with fi < 2 have tails decaying as w and thus a formally infinite variance. This 
means that (R 2 \S) is actually dominated by the region where R is of order S, such that indeed: 

(R 2 \S)*s£* (45) 

Therefore, the Levy stable distribution only holds in the scaling region R - S 1 ^. For R - S > S 1 ^ the distribution 
is truncated. When fi — ► 1, the truncation 'invades' the scaling regime, and the result becomes again different for 
[i < 1, see below. 

The conclusion of this analysis is that the variance of the relative growth rate r = R/S scales in this regime with 
an exponent (3 — (/z — l)/2, that interpolates between the standard value /3 = l/2for^ = 2 and (3 = for fi = 1 
(although these marginal cases are affected by logarithmic corrections). However, the surprising result is that in this 
regime the distribution of R does not re-scale as a function of rS^ 3 but rather as rS^^ 1 '' 1 * B We will discuss this in 
relation with empirical results in the next section. 



2 For other situations where this 'anomalous scaling' occur, see 
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When /i < 1, it is easy to show that now R ~ S, i.e. (3 = 0, which disagrees with empirical results. Furthermore, 
the result one finds for the scaling function II is no longer universal. When /x > 1, the scaling function was universal 
in the sense that its shape only relied on the finiteness of the variance of rj. When \i < 1, on the other hand, only 
a finite number of terms (sub-entities) contribute to the sum i?, and one cannot expect a Central Limit Theorem to 
hold. When P{rj) is Gaussian of variance a 2 , all moments of P{R\S) can be computed using the method of fl5|| . One 
finds for example: 



(R 2 \S) = 



1-M 
1 + // 



,-r 2 Q' 2 



(46) 



and: 



(R 4 \S) = 3 



(3-^)(2- M )(l- M ) + 2 M (l- / i) 2 
(3 + M )(2 + /i)(l + /x) 



cr ,S 



(47) 



Numerically, we have found that P(R\S) could be rather well fitted by a 'stretched Gaussian' form, exp— (R/S) a 
with a < 2. For example, for fi = 1/2, we found a ~ 4/3. This cannot be exact, however, since the exact kurtosis is 
found to be 1.37143, whereas the kurtosis of the stretched Gaussian with a = 4/3 is 1.22219. Note that (R^IS) would 
have a different value if P(j]) was non Gaussian: this shows that for /i < 1 the distribution Tl(v) is non universal. 



D. Conditional distribution of sector sizes 



Finally, one can also compute in this model the conditional distribution of sector sizes, -P(s|S f ), which depends on 
the value of a. When a < /i, we find that P(slS') is the sum of two contributions: one power-law regime s _1 ~ M for 
s <C S which reflects the a priori distribution of sector sizes, and a small 'hump' for s ~ S of height which vanishes 
for large S: 

P(*|S)«ig|, ( S «5); P( s |S)«|gM ( 8 „s), (48) 

where F(.) is a certain scaling function of order unity, that vanishes for s > S. For /j, < a, one the other hand, the 
hump survives when S — > oo whereas the power-law regime disappears. In other words, when a is larger than /i, the 
typical number of sectors K* tends to be small and the typical size of the sectors is of the order of S itself. 



E. Stability upon aggregation 



As mentioned in the introduction, the scaling of GNP growth rates is empirically found to be very similar to the 
scaling of company growth ||. In this respect, it is worth noting that Sutton's construction is not stable upon 
aggregation: aggregating companies characterized by an exponent /3 = 1/4 using Sutton's prescription at the country 
level, leads to an exponent /3 = 3/8. In our model, on the other hand, stability upon aggregation is by construction 
satisfied. The argument is very simple, and relies on the fact that the results are independent of the value of the 
company size exponent a, provided a < /i. The idea is to consider the GNP itself as the sum of independent sectors, 
i.e. to remove the 'shells' that define companies, which are an intermediate level of clustering. A country is therefore 
in this description a 'super-company' with many sectors. The sectors are the same than previously, so they have the 
very same Pareto tail of exponent \x for their size distribution. Now we just have to assume that there is a given 
distribution Q'(K) that describes the distribution of the number of independent sectors in different countries. If 
Q'(K) has a Pareto tail with exponent a' with a' < fi, we can repeat the above arguments and find the same value 
for the exponent (3 = (/i— l)/2 at the country level. 



IV. DISCUSSION - COMPARISON WITH EMPIRICAL DATA 



We have shown how several interesting asymptotic predictions of Sutton's model could be derived. Apart from 
Sutton's central result, namely that the root mean square of the growth rate decreases with the company size S as 
(i.e. /3 — 1/4), we have shown that the distribution of growth rate should be asymptotically Gaussian, with a 
kurtosis that decays as S~ 1 / 2 . The first result is, as noticed by Sutton, in rather good agreement with the empirical 
results of Stanley et al. [p], although the value of the exponent (3 is closer to 0.18. The second result is however 
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v=r S 

FIG. 1. Distribution 11(d) of the rescaled returns for v > and two values of S: S ~ 500 and S « 5000, and for fi = 1.3. A 
simple exponential form, as suggested in is shown for comparison. Note that for this range of S, the distributions do not 
re-scale, except in the 'central' region. For large values of S, the distribution should converge towards a Levy distribution of 
index [i = 1.3: one can clearly see the tails getting fatter as S increases. 

problematic, since in this model one should find a rescaled distribution of growth rates that progressively deforms 
with S as to become Gaussian for very large S, whereas the data indicates that the rescaled distribution is actually to 
a good approximation independent of S and non Gaussian. A closer look at the data of Stanley et al. in fact suggests 
that non Gaussian tails are more pronounced for larger companies . 

We have then explored an alternative to Sutton's model, where the size of the 'sub-entities' is postulated to be 
a power-law (Pareto) with an exponent /i. This is motivated by the ubiquitous observation of Pareto distributions 
for company sizes, and by a simple dynamical model that indeed leads to a stationary power-law distribution of 
sizes. In this respect, it is not obvious how one would write a natural dynamics for sector growth that leads to 
Sutton's 'microcanonical' ensemble where all partitions are equiprobable. As a function of /i, we have found three 
qualitatively different regimes. In particular, when 1 < /i < 2, we find that /3 = (/J, — l)/2. The empirical value 
(3 = 0.18 corresponds to fi = 1.36, which is indeed larger than the value of a w 1.05 reported for firm sizes in as 
required for the consistency of our analysis. Our model then predicts an S independent distribution for the growth 
rate multiplied by S^^'^ (and not by S@), which is a symmetric Levy stable distribution. Note however that these 
are asymptotic results that require S^ -1 )/^ 1 ^> 1, such that the scaling region is not affected by truncation effects 
(see the discussion after Eq. [li]). For finite S and \i close to one, one expects strong finite size effects, and a very slow 
convergence towards the asymptotic value. This is why numerical simulations are needed to explore the moderate 
S regime. We show in Fig. 1 the distribution of rescaled returns obtained from a numerical simulation for /i = 1.3, 
a Gaussian P(rj) and for S ~ 500 and S — 5000. Notice H(v) can be very roughly approximated by a symmetric 
exponential for small enough S: H(v) = exp(—\v\/v ), as suggested by the empirical data. The systematic deviations 
from this form both at small values of v and at large v are qualitatively similar to the ones observed empirically (see 
||): H(v) is actually parabolic for small values of v and decays slower than exponentially at large v. One also observes 
strong finite size effects: as S increases, the tail of the distribution becomes fatter and fatter. This is expected since 
asymptotically this distribution should converge to a Levy distribution with a power-law tail, which is indeed fatter 
than an exponential. Note that we expect only qualitative agreement with empirical data, since the assumption that 
P(r}) is Gaussian at the sector level is probably incorrect and does influence the detailed shape of H(v) for finite S. 
However, as mentioned above, the systematic 'fattening' of the tails as S becomes larger seems to be present in the 
empirical results of Stanley et al. 0. 

It would be extremely interesting to obtain direct empirical information on the conditional distribution of the size 
s and total number K of the sub-entities for a fixed S. We have seen that Sutton's model predicts a Bose-Einstein 
distribution for s, that behaves as 1/s for s <§; y/S, and beyond which it falls rapidly, whereas K becomes peaked 
around the value y/~ShiS. In our model, on the other hand, the conditional distribution of s is, as soon as /i > a 
and for s <C S, identical to the a priori distribution p(s) ~ s -1 ~ M , and the total number K peaks around the value 
S/(s). Therefore, a tangible difference between the two models is that the power-law regime has an exponent 1 in 
the Sutton model and the size of the sectors rarely exceeds \/S, whereas the the distribution is a power-law with 
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exponent 1 + /z w 2.35 up to S in our model (with possibly a small hump for s ~ 5, see Eq. (fi"8|)). We hope that 
these falsifiable predictions of the two descriptions, as well as the quantitative description of the rescaled distribution 
of growth rates given above, will motivate further empirical and theoretical research, and help elucidate the 'scaling 
puzzle' of company growth. 

NOTE ADDED: 

While completing this work, X. Gabaix sent us a very interesting preprint where related arguments (although in 
details quite different from ours) are discussed. See: X. Gabaix, Power-laws and the origin of the business cycle, 
working paper, October 2002. 
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